AITopics | rtx 2080

Collaborating Authors

rtx 2080

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

LearningAboutObjects byLearningtoInteractwithThem-SupplementaryMaterial-1 Modeldetails

Neural Information Processing SystemsFeb-7-2026, 21:35:27 GMT

There is a force magnituder (r = 0,1,2) associated to each successful interaction (namely the magnitude predicted at the time of interaction), and feedback reflecting whether this force was: 1. just right, 2. too small, or 3. too large.

artificial intelligence, interaction, machine learning, (19 more...)

Neural Information Processing Systems

Country: North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.05)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.47)

Add feedback

Slim Scheduler: A Runtime-Aware RL and Scheduler System for Efficient CNN Inference

Harshbarger, Ian, Chidambaram, Calvin

arXiv.org Artificial IntelligenceOct-13-2025

Most neural network scheduling research focuses on optimizing static, end-to-end models of fixed width, overlooking dynamic approaches that adapt to heterogeneous hardware and fluctuating runtime conditions. We present Slim Scheduler, a hybrid scheduling framework that integrates a Proximal Policy Optimization (PPO) reinforcement learning policy with algorithmic, greedy schedulers to coordinate distributed inference for slimmable models. Each server runs a local greedy scheduler that batches compatible requests and manages instance scaling based on VRAM and utilization constraints, while the PPO router learns global routing policies for device selection, width ratio, and batch configuration. This hierarchical design reduces search space complexity, mitigates overfitting to specific hardware, and balances efficiency and throughput. Compared to a purely randomized task distribution baseline, Slim Scheduler can achieve various accuracy and latency trade-offs such as: A 96.45% reduction in mean latency and a 97.31% reduction in energy usage dropping accuracy to the slimmest model available (70.3%). It can then accomplish an overall reduction in average latency plus energy consumption with an increase in accuracy at the cost of higher standard deviations of said latency and energy, effecting overall task throughput.

artificial intelligence, machine learning, reinforcement learning, (17 more...)

arXiv.org Artificial Intelligence

2510.09018

Country: North America > United States > California > Orange County > Irvine (0.05)

Genre: Research Report (0.82)

Industry: Energy (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.51)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.36)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.34)

Add feedback

Efficient Finetuning for Dimensional Speech Emotion Recognition in the Age of Transformers

Sampath, Aneesha, Tavernor, James, Provost, Emily Mower

arXiv.org Artificial IntelligenceFeb-17-2025

Accurate speech emotion recognition is essential for developing human-facing systems. Recent advancements have included finetuning large, pretrained transformer models like Wav2Vec 2.0. However, the finetuning process requires substantial computational resources, including high-memory GPUs and significant processing time. As the demand for accurate emotion recognition continues to grow, efficient finetuning approaches are needed to reduce the computational burden. Our study focuses on dimensional emotion recognition, predicting attributes such as activation (calm to excited) and valence (negative to positive). We present various finetuning techniques, including full finetuning, partial finetuning of transformer layers, finetuning with mixed precision, partial finetuning with caching, and low-rank adaptation (LoRA) on the Wav2Vec 2.0 base model. We find that partial finetuning with mixed precision achieves performance comparable to full finetuning while increasing training speed by 67%. Caching intermediate representations further boosts efficiency, yielding an 88% speedup and a 71% reduction in learnable parameters. We recommend finetuning the final three transformer layers in mixed precision to balance performance and training efficiency, and adding intermediate representation caching for optimal speed with minimal performance trade-offs. These findings lower the barriers to finetuning speech emotion recognition systems, making accurate emotion recognition more accessible to a broader range of researchers and practitioners.

recognition, transformer layer, wav2v ec 2, (14 more...)

arXiv.org Artificial Intelligence

2503.03756

Country: North America > United States > Michigan > Washtenaw County > Ann Arbor (0.05)

Genre:

Research Report > New Finding (0.94)
Research Report > Experimental Study (0.68)

Technology:

Information Technology > Artificial Intelligence > Speech (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Emotion (1.00)

Add feedback

Efficient Motion Prediction: A Lightweight & Accurate Trajectory Prediction Model With Fast Training and Inference Speed

Prutsch, Alexander, Bischof, Horst, Possegger, Horst

arXiv.org Artificial IntelligenceSep-25-2024

For efficient and safe autonomous driving, it is essential that autonomous vehicles can predict the motion of other traffic agents. While highly accurate, current motion prediction models often impose significant challenges in terms of training resource requirements and deployment on embedded hardware. We propose a new efficient motion prediction model, which achieves highly competitive benchmark results while training only a few hours on a single GPU. Due to our lightweight architectural choices and the focus on reducing the required training resources, our model can easily be applied to custom datasets. Furthermore, its low inference latency makes it particularly suitable for deployment in autonomous applications with limited computing resources.

dataset, prediction, proc, (15 more...)

arXiv.org Artificial Intelligence

2409.16154

Country:

North America > United States (0.04)
Europe > Austria > Styria > Graz (0.04)

Genre: Research Report (0.50)

Industry:

Information Technology (0.36)
Transportation > Ground > Road (0.35)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.87)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.69)

Add feedback

Efficient Latency-Aware CNN Depth Compression via Two-Stage Dynamic Programming

Kim, Jinuk, Jeong, Yeonwoo, Lee, Deokjae, Song, Hyun Oh

arXiv.org Artificial IntelligenceJun-2-2023

Recent works on neural network pruning advocate that reducing the depth of the network is more effective in reducing run-time memory usage and accelerating inference latency than reducing the width of the network through channel pruning. In this regard, some recent works propose depth compression algorithms that merge convolution layers. However, the existing algorithms have a constricted search space and rely on human-engineered heuristics. In this paper, we propose a novel depth compression algorithm which targets general convolution operations. We propose a subset selection problem that replaces inefficient activation layers with identity functions and optimally merges consecutive convolution operations into shallow equivalent convolution operations for efficient end-to-end inference latency. Since the proposed subset selection problem is NP-hard, we formulate a surrogate optimization problem that can be solved exactly via two-stage dynamic programming within a few seconds. We evaluate our methods and baselines by TensorRT for a fair inference latency comparison. Our method outperforms the baseline method with higher accuracy and faster inference speed in MobileNetV2 on the ImageNet dataset. Specifically, we achieve $1.41\times$ speed-up with $0.11$\%p accuracy gain in MobileNetV2-1.0 on the ImageNet.

artificial intelligence, machine learning, optimization problem, (16 more...)

arXiv.org Artificial Intelligence

2301.12187

Country:

North America > United States > Hawaii > Honolulu County > Honolulu (0.04)
Asia > South Korea > Seoul > Seoul (0.04)

Genre: Research Report (0.63)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)
Information Technology > Artificial Intelligence > Vision (0.92)

Add feedback

How weak is YOUR password? Graphic shows exactly how long it would take hackers to break it

Daily Mail - Science & techMay-17-2023, 14:51:45 GMT

As tedious as the incessant requests are for longer and harder-to-remember passwords, experts say there's good reason for the nuisance. It's gotten easier and easier for hackers to guess your password as computer processing speeds have gotten faster. With sprawling cloud-based computer power now available for rent to anyone -- and massive supercomputers out there, like the system that trained ChatGPT -- cyber security firm Hive Systems says that a truly professional hacker could access your secrets almost instantly. The company has produced a new table showing just how safe or vulnerable your password is, based on its character count and the diversity of characters you've used. They say you'll need a fully random password, that's at least 12-characters long, with a mixture of numbers, special symbols, upper- and lowercase letters, if you want to keep even just an amateur hacker out of your account, thanks to the power of today's consumer desktop tech.

hacker, hive, password, (15 more...)

Daily Mail - Science & tech

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.32)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.32)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.32)

Add feedback

Top GPUs For Deep Learning and Machine Learning in 2022

#artificialintelligenceSep-6-2022, 04:55:08 GMT

As we walk into the age of AI, there is an exponential rise in the demand for GPU. The not-so-old method of parallel computing is applied to process computations in GPUs. Moreover, with the availability of very high numbers of ALUs or processing units, GPUs have become very suitable for powerful computations in AI. Furthermore, with the recent advent of Deep Learning in the current decade, most of the Deep Learning frameworks, including vastly popular TensorFlow, Pytorch, Theano, etc., enable advanced optimization of computations with GPU. Currently, a vast number of GPUs are available, with many differences in their features, like no. of processing units, memory capacity, clock frequency, etc.

batch size, clock speed, deep learning, (13 more...)

#artificialintelligence

Country: Asia > India > West Bengal > Kolkata (0.05)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

YolactEdge: Real-time Instance Segmentation on the Edge (Jetson AGX Xavier: 30 FPS, RTX 2080 Ti: 170 FPS)

Liu, Haotian, Soto, Rafael A. Rivera, Xiao, Fanyi, Lee, Yong Jae

arXiv.org Artificial IntelligenceDec-22-2020

We propose YolactEdge, the first competitive instance segmentation approach that runs on small edge devices at real-time speeds. Specifically, YolactEdge runs at up to 30.8 FPS on a Jetson AGX Xavier (and 172.7 FPS on an RTX 2080 Ti) with a ResNet-101 backbone on 550x550 resolution images. To achieve this, we make two improvements to the state-of-the-art image-based real-time method YOLACT: (1) TensorRT optimization while carefully trading off speed and accuracy, and (2) a novel feature warping module to exploit temporal redundancy in videos. Experiments on the YouTube VIS and MS COCO datasets demonstrate that YolactEdge produces a 3-5x speed up over existing real-time methods while producing competitive mask and box detection accuracy. We also conduct ablation studies to dissect our design choices and modules. Code and models are available at https://github.com/haotian-liu/yolact_edge.

segmentation, video, yolactedge, (14 more...)

arXiv.org Artificial Intelligence

2012.12259

Country: North America > United States > California > Yolo County > Davis (0.04)

Genre: Research Report (0.50)

Industry: Information Technology (0.47)

Technology:

Information Technology > Communications (1.00)
Information Technology > Architecture > Real Time Systems (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)

Add feedback

Collaborating Authors

rtx 2080

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

9766527f2b5d3e95d4a733fcfb77bd7e-Supplemental.pdf

LearningAboutObjects byLearningtoInteractwithThem-SupplementaryMaterial-1 Modeldetails

Slim Scheduler: A Runtime-Aware RL and Scheduler System for Efficient CNN Inference

9766527f2b5d3e95d4a733fcfb77bd7e-Supplemental.pdf

Efficient Finetuning for Dimensional Speech Emotion Recognition in the Age of Transformers

Efficient Motion Prediction: A Lightweight & Accurate Trajectory Prediction Model With Fast Training and Inference Speed

Efficient Latency-Aware CNN Depth Compression via Two-Stage Dynamic Programming

How weak is YOUR password? Graphic shows exactly how long it would take hackers to break it

Top GPUs For Deep Learning and Machine Learning in 2022

YolactEdge: Real-time Instance Segmentation on the Edge (Jetson AGX Xavier: 30 FPS, RTX 2080 Ti: 170 FPS)